Blar i NTNU Open på forfatter "Johnsen, Magne Hallstein"

A Framework for Speech Recognition using Logistic Regression

Birkenes, Øystein (Doktoravhandlinger ved NTNU, 1503-8181; 2007:165, Doctoral thesis, 2007)

Although discriminative approaches like the support vector machine or logistic regression have had great success in many pattern recognition application, they have only achieved limited success in speech recognition. Two ...

Analyse av Ulike Akustiske Taleegenskaper for Deteksjon av Artikulatoriske Attributter

Fasseland, Claus (Master thesis, 2007)

Arbeidet bak denne rapporten har omhandlet analysering av egendefinerte taleegenskapers potensial ved detektering av lydenheter (attributter). Et HTK-basert simuleringsverktøy har blitt utviklet for å kunne kartlegge ...

Audiovisual Contents Segmentation

Sundøy, Kristoffer Johan (Master thesis, 2010)

The objective of this thesis is to detect high level semantic ideas to help to impose a structure on television talk shows. Indexing TV-shows is a subject that, to our knowledge, is rarely talked about in the scientific ...

Brukerforsøk med multimodal demonstrator

Aas, Asbjørn (Master thesis, 2006)

Med en multimodal applikasjon utviklet ved Telenor FoU dokumenteres forskjellige brukergruppers nytte av multimodale systemer.

Computer Assisted Pronunciation Training: Evaluation of non-native vowel length pronunciation

Versvik, Eivind (Master thesis, 2009)

Computer Assisted Pronunciation Training systems have become popular tools to train on second languages. Many second language learners prefer to train on pronunciation in a stress free environment with no other listeners. ...

Evaluating Vowel Pronunciation in Computer Assisted Pronunciation Training.

Erichsen, Stian (Master thesis, 2011)

Computer Assisted Pronunciation Training (CAPT) applications are tools that can be used when learning a second language. By evaluating the speech of a student, the CAPT system is able to give automatic feedback on his or ...

Implementation and Adaptation of a System for Automatic Classification of Birdsong.

Selboe, Kristian (Master thesis, 2015)

The background for this master thesis is a collaboration between Able Magic and NTNU. At the request of Able Magic there has been conducted a feasibility study by NTNU, which has led to the development of a bird classification ...

Marvina – A Norwegian Speech Centric, Multimodal Visitors’ Guide

Hartvigsen, Ole; Harborg, Erik; Amble, Tore; Johnsen, Magne Hallstein (Lecture, 2007)

This paper describes the development and testing of a multimodal visitors’ guide service for guests to the city and university in Trondheim. The system is under continuous development. At the present ...

Modeling and Confidence in a System for Automatic Classification of Birdsong

Aagaard, Fredrik Fløttum (Master thesis, 2015)

It turns out that using a two-state-HMM model structure with appurtenant GMM-based state distributions improves the system performance compared to the use of just GMMs as model structure for each bird specie. Hence, it is ...

Modeling and monitoring cerebral blood flow in premature infants with patent ductus arteriosus using ultrasound Doppler technique

Tran, Alex (Master thesis, 2018)

The aim of this thesis was to simulate cerebral bloodflow of premature infants with patent ductus arteriosus, using a lumped model. A linear Windkessel-3 model with constant peripheral resistance, and a non-linear Windkessel-3 ...

Modeling of Peripheral Resistance in the Microvasculature for Diabetic Patients with Ultrasound Doppler Technique

Wisløff, Anna Karoline (Master thesis, 2018)

Diabetes is a growing health problem worldwide and involves long-term complications. Microangiopathy is one of the main issues associated with diabetes and is a condition that is difficult to detect. The aim of this project ...

Speech Enhancement with Deep Neural Networks

Melve, Olav Klungsøyr (Master thesis, 2016)

This master thesis describes the implementation and evaluation of a promising approach to speech enhancement based on deep neural networks. A baseline system was imple- mented and trained using noisy data synthesized by ...

Transient Noise Event Detection in Recorded Speech

Dahl, Jørund Kaarstad (Master thesis, 2013)

Transient Noise Event Detection in Speech is the ability to detect a transient noise event like loud knocks, crumpling of paper, and other impulsive sounds in recorded speech.Transient Event Detection is a relevant issue ...

A Two-Stage Deep Modeling Approach to Articulatory Inversion

Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Imran, Ali Shariq; Johnsen, Magne Hallstein; Siniscalchi, Sabato Marco; Svendsen, Torbjørn Karl (Chapter, 2021)

This paper proposes a two-stage deep feed-forward neural network (DNN) to tackle the acoustic-to-articulatory inversion (AAI) problem. DNNs are a viable solution for the AAI task, but the temporal continuity of the estimated ...

Using neural networks for the detection of cracking sounds during coffee roasting - Data collection, annotation, development and evaluation

Arbo, Einar Wigum (Master thesis, 2018)

During the work on this thesis, 9 hours and 55 minutes of coffee roasting was collected by the use of several different modern cellphones. The recordings obtained during the work on this thesis, as well as an additional 3 ...

Volume flow estimation of blood flow using speckle tracking

Neteland, Silje (Master thesis, 2013)

A simulation of a carotid artery with no tilt and an artery with a 20 degrees tilt is performed in order to test the performance of a multi-dimensional flow velocity estimation technique based on speckle tracking. The ...